Sangamon County
- North America > United States > Ohio (0.05)
- North America > United States > Texas > Travis County > Austin (0.04)
- North America > United States > Washington > King County > Seattle (0.04)
- (13 more...)
- Research Report (0.68)
- Workflow (0.46)
- Government (0.67)
- Transportation > Passenger (0.46)
- Transportation > Air (0.46)
Unsupervised decoding of encoded reasoning using language model interpretability
As large language models become increasingly capable, there is growing concern that they may develop reasoning processes that are encoded or hidden from human oversight. To investigate whether current interpretability techniques can penetrate such encoded reasoning, we construct a controlled testbed by fine-tuning a reasoning model (DeepSeek-R1-Distill-Llama-70B) to perform chain-of-thought reasoning in ROT-13 encryption while maintaining intelligible English outputs. We evaluate mechanistic interpretability methods--in particular, logit lens analysis--on their ability to decode the model's hidden reasoning process using only internal activations. We show that logit lens can effectively translate encoded reasoning, with accuracy peaking in intermediate-to-late layers. Finally, we develop a fully unsupervised decoding pipeline that combines logit lens with automated paraphrasing, achieving substantial accuracy in reconstructing complete reasoning transcripts from internal model representations. These findings suggest that current mechanistic interpretability techniques may be more robust to simple forms of encoded reasoning than previously understood. Our work provides an initial framework for evaluating interpretability methods against models that reason in non-human-readable formats, contributing to the broader challenge of maintaining oversight over increasingly capable AI systems.
- North America > United States > Illinois > Sangamon County > Springfield (0.14)
- North America > United States > Illinois > Cook County > Chicago (0.07)
- North America > United States > California > Sacramento County > Sacramento (0.05)
- (22 more...)
Progressive Image Restoration via Text-Conditioned Video Generation
Kang, Peng, Wang, Xijun, Yuan, Yu
Recent text-to-video models have demonstrated strong temporal generation capabilities, yet their potential for image restoration remains underexplored. In this work, we repurpose CogVideo for progressive visual restoration tasks by fine-tuning it to generate restoration trajectories rather than natural video motion. Specifically, we construct synthetic datasets for super-resolution, deblurring, and low-light enhancement, where each sample depicts a gradual transition from degraded to clean frames. Two prompting strategies are compared: a uniform text prompt shared across all samples, and a scene-specific prompting scheme generated via LLaVA multi-modal LLM and refined with ChatGPT. Our fine-tuned model learns to associate temporal progression with restoration quality, producing sequences that improve perceptual metrics such as PSNR, SSIM, and LPIPS across frames. Extensive experiments show that CogVideo effectively restores spatial detail and illumination consistency while maintaining temporal coherence. Moreover, the model generalizes to real-world scenarios on the ReLoBlur dataset without additional training, demonstrating strong zero-shot robustness and interpretability through temporal restoration.
- North America > United States > Illinois > Sangamon County > Springfield (0.14)
- North America > United States > Indiana > Tippecanoe County > West Lafayette (0.04)
- North America > United States > Indiana > Tippecanoe County > Lafayette (0.04)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
- North America > United States > Ohio (0.04)
- Europe > Netherlands (0.04)
- (6 more...)
- Law (1.00)
- Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
- Education (0.93)
- (2 more...)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
- Information Technology > Data Science > Data Mining (0.93)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.67)
SynthTools: A Framework for Scaling Synthetic Tools for Agent Development
Castellani, Tommaso, Ye, Naimeng, Mittal, Daksh, Yen, Thomson, Namkoong, Hongseok
AI agents increasingly rely on external tools to solve complex, long-horizon tasks. Advancing such agents requires reproducible evaluation and large-scale training in controllable, diverse, and realistic tool-use environments. However, real-world APIs are limited in availability, domain coverage, and stability, often requiring access keys and imposing rate limits, which render them impractical for stable evaluation or scalable training. To address these challenges, we introduce SynthTools, a flexible and scalable framework for generating synthetic tool ecosystems. Our framework consists of three core components: Tool Generation for automatic and scalable creation of diverse tools, Tool Simulation to emulate realistic tool behaviors, and Tool Audit to ensure correctness and consistency of tool simulation. To illustrate its scalability, we show that SynthTools can readily produce toolsets that span twice as many domains and twice as many tools per domain as prior work. Furthermore, the tool simulation and tool audit components demonstrate strong reliability, achieving $94\%$ and $99\%$ accuracy respectively. Finally, we construct downstream tasks from the generated tools that even state-of-the-art models struggle to complete. By enabling scalable, diverse, and reliable tool ecosystems, SynthTools provides a practical path toward large-scale training and stable evaluation of tool-use agents. Our code is available at https://github.com/namkoong-lab/SynthTools.
- Europe > Austria > Vienna (0.14)
- North America > United States > Illinois > Cook County > Chicago (0.04)
- North America > United States > Wisconsin > Milwaukee County > Milwaukee (0.04)
- (9 more...)
- Transportation (1.00)
- Information Technology (1.00)
- Health & Medicine (1.00)
- (3 more...)
Machine Learning Enabled Early Warning System For Financial Distress Using Real-Time Digital Signals
pant, Laxmi, Reza, Syed Ali, Rahman, Md Khalilor, Rahman, MD Saifur, Sharmin, Shamima, Mithu, Md Fazlul Huq, Hasnain, Kazi Nehal, Farabi, Adnan, khanom, Mahamuda, Kabir, Raisul
International Journal of Applied Mathematics Volume 38 No. 5 s, 2025 ISSN: 1311 - 1728 (printed version); ISSN: 1314 - 8060 (on - line version) Received: August 0 7, 2025 550 Abstract The growing instability of both global and domestic economic environments has increased the risk of financial distress at the household level. However, traditional econometric models often rely on delayed and aggregated data, limiting their effectiveness. This study introduces a machine learning - based early warning system that utilizes real - time digital and macroeconomic signals to identify financial distress in near real - time. Using a panel dataset of 750 households tracked over three monitoring rounds spa nning 13 months, the framework combines socioeconomic attributes, macroeconomic indicators (such as GDP growth, inflation, and foreign exchange fluctuations), and digital economy measures (including ICT demand and market volatility). Through data preproces sing and feature engineering, we introduce lagged variables, volatility measures, and interaction terms to capture both gradual and sudden changes in financial stability. We benchmark baseline classifiers, such as logistic regression and decision trees, ag ainst advanced ensemble models including random forests, XGBoost, and LightGBM. Our results indicate that the engineered features from the digital economy significantly enhance predictive accuracy. The system performs reliably for both binary distress dete ction and multi - class severity classification, with SHAP - based explanations identifying inflation volatility and ICT demand as key predictors. Crucially, the framework is International Journal of Applied Mathematics Volume 38 No. 5 s, 2025 ISSN: 1311 - 1728 (printed version); ISSN: 1314 - 8060 (on - line version) Received: August 0 7, 2025 551 By implementing machine learning in a transparent and interpretable manner, this study demonstrates the feasibility and impact of providing near - real - time early warnings of financial distress. This offers actionable insights that can strengthen household resilience and guide preemptive intervention strategies. Keywords: Financial Distress, Early Warning Systems, Machine Learning, Digital Economy, Temporal Classification, Explainable AI 1. Introduction 1.1 Background and Motivation The prediction of financial distress has long been recognized as a critical element for ensuring economic resilience and mitigating systemic risk across households, firms, and national economies.
- Asia > China (0.04)
- North America > United States > Pennsylvania > Erie County > Erie (0.04)
- North America > United States > New York > Suffolk County > Stony Brook (0.04)
- (4 more...)
- Information Technology > Security & Privacy (1.00)
- Banking & Finance > Trading (1.00)
- Banking & Finance > Economy (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.87)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
- Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.69)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.48)
- North America > United States > Ohio (0.05)
- North America > United States > Texas > Travis County > Austin (0.04)
- North America > United States > Washington > King County > Seattle (0.04)
- (13 more...)
- Research Report (0.68)
- Workflow (0.46)
- Government (0.67)
- Transportation > Passenger (0.46)
- Transportation > Air (0.46)
Procedural Environment Generation for Tool-Use Agents
Sullivan, Michael, Hartmann, Mareike, Koller, Alexander
Although the power of LLM tool-use agents has ignited a flurry of recent research in this area, the curation of tool-use training data remains an open problem$-$especially for online RL training. Existing approaches to synthetic tool-use data generation tend to be non-interactive, and/or non-compositional. We introduce RandomWorld, a pipeline for the procedural generation of interactive tools and compositional tool-use data. We show that models tuned via SFT and RL on synthetic RandomWorld data improve on a range of tool-use benchmarks, and set the new SoTA for two metrics on the NESTFUL dataset. Further experiments show that downstream performance scales with the amount of RandomWorld-generated training data, opening up the possibility of further improvement through the use of entirely synthetic data.
- Asia > Thailand > Bangkok > Bangkok (0.04)
- Asia > Indonesia > Java > Jakarta > Jakarta (0.04)
- North America > United States > Illinois > Sangamon County > Springfield (0.04)
- (4 more...)
- Media > Film (1.00)
- Leisure & Entertainment (1.00)
- Consumer Products & Services > Travel (0.93)
- Transportation > Passenger (0.68)
Dark Patterns Meet GUI Agents: LLM Agent Susceptibility to Manipulative Interfaces and the Role of Human Oversight
Tang, Jingyu, Chen, Chaoran, Li, Jiawen, Zhang, Zhiping, Guo, Bingcan, Khalilov, Ibrahim, Gebreegziabher, Simret Araya, Yao, Bingsheng, Wang, Dakuo, Ye, Yanfang, Li, Tianshi, Xiao, Ziang, Yao, Yaxing, Li, Toby Jia-Jun
The dark patterns, deceptive interface designs manipulating user behaviors, have been extensively studied for their effects on human decision-making and autonomy. Yet, with the rising prominence of LLM-powered GUI agents that automate tasks from high-level intents, understanding how dark patterns affect agents is increasingly important. We present a two-phase empirical study examining how agents, human participants, and human-AI teams respond to 16 types of dark patterns across diverse scenarios. Phase 1 highlights that agents often fail to recognize dark patterns, and even when aware, prioritize task completion over protective action. Phase 2 revealed divergent failure modes: humans succumb due to cognitive shortcuts and habitual compliance, while agents falter from procedural blind spots. Human oversight improved avoidance but introduced costs such as attentional tunneling and cognitive load. Our findings show neither humans nor agents are uniformly resilient, and collaboration introduces new vulnerabilities, suggesting design needs for transparency, adjustable autonomy, and oversight.
- Europe > Austria > Vienna (0.14)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
- North America > United States > New York > New York County > New York City (0.06)
- (15 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (0.92)
- Information Technology > Security & Privacy (1.00)
- Health & Medicine (0.94)
- Law (0.92)
- Government (0.67)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
- North America > United States > Ohio (0.04)
- Europe > Netherlands (0.04)
- (6 more...)
- Law (1.00)
- Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
- Education (0.93)
- (2 more...)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)
- Information Technology > Data Science > Data Mining (0.93)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.67)